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A closed-form expression for the capacity of an array of correlated Gaussian 
channels is derived. It is shown that when signal and noise are independent, the 
array of observables can be replaced with a single observable without diminishing the 
capacity of the array channel. Examples are provided to illustrate the dependence 
of channel capacity on noise correlation for two- and three-channel arrays. 


I. Introduction 

In this article, we formulate the framework to evaluate the channel capacity of an array system. We 
define the channel capacity of an array channel as the maximum of the mutual information between a 
singl e input source and an array of n output observables over all distributions on the input that satisfy 
certain constraints (e.g., power, bandwidth, discrete or continuous, etc.). We derive a closed-form general 
formula to the channel capacity of an array of n Gaussian channels. This formula applies to correlated and 
uncorrelated noise, as long as the Gaussian assumption holds and the second-order statistics (covariance) 
of the signal and noise sources can be characterized. This formula allows one to find the channel capacity 
of various array constellations and orientations in the presence of correlated and uncorrelated noise. Some 
of the interesting results that we observed are as follows: 


(1) When the noise sources are uncorrelated, the array channel capacity is equivalent to the 
channel capacity of a single Gaussian channel with a signal-to-noise ratio (SNR) equal 
to the sum of the SNRs of the individual channels. 

(2) The array channel capacity is lower bounded by the channel capacity of the channel with 
the highest SNR. 


II. Problem Formulation 

We consider transmitting a single complex source through n channels, as illustrated in Fig. 1. Let 
a x ,ij = EXiXf,crz,ij = EZiZ ■?, and o Y ,ij = EYjY J for 1 < i, j < n. Notice that o 2 xl , o X2 , • • • , o X n are 
the signal powers as seen by the receivers; cr| v cr| 2 , • • • , o\ n are the noise variances; and Oy x , Oy 2 , • • • , 
Oy n are the channel output variances. The array channel capacity is given by 
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CHANNEL 



C array — m&X I{Y \ , Y 2 , • • • , Y n , X ) 

p(x) 

= max{ff(Fi, r 2 , ■■■,Y n )-H(Y 1 ,Y 2 ,---,Y n \X)} 

p(x) 

= max{H(Y u Y 2 , ■ ■ ■ , Y n ) - F(Z 1; Z 2 , • • • , Z„)} (1) 

p(x) 

where we further assume a power constraint on X, and where the Xi are obtained from X by a deter- 
ministic operation on each i. The interpretation of this model is that signals in the various channels 
can have different magnitudes and phases, but that the differential delays between the waveforms are 
negligible, having been removed by delay compensation. Thus, the signal path differences between the 
various channels either are negligibly small, as in the case of array-feed or phased-array applications, or 
have been properly equalized, as in the case of antenna arrays. 


HI. Capacity of an Array of Gaussian Channels 

In the following derivation, we will use some results from Cover and Thomas [1]. Since Z T = 
(Zi, Z 2 , • • • , Z n ) is a complex Gaussian random vector, H(Z) is given by [1, Theorem 9.4T] 

H(Z) — - log 2 (27re) n |©z| bits/channel use (2) 

where ©z is the covariance matrix of Z, and |©z| is its determinant. Prom Theorem 9.6.5 of [1] and 
under the assumption that the input source is power constrained to P, the input source X that maximizes 
H(Yi,Y 2 , - • ■ ,Y n ) has a Gaussian distribution. The maximum mutual information and, therefore, the 
array channel capacity are achieved with a Gaussian source, and the output observables Yi,Y 2 , • • • ,Y n 
are correlated complex Gaussian variables. Using Theorem 9.4.1 of [1], H(Yi,Y 2 , ■ ■ ■ , Y n ) is given by 
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( 3 ) 


H(Y u Y 2 , ■ ■ ■ ,Y n ) = ilog 2 (27re) n |©y | bits/channel use 


where ©y is the covariance matrix of Yj, Y 2 , ■ ■ ■ , Y n . The array channel capacity is 


y array 


1 |0y 1 

— log 2 — — - bits/channel use 

2 |©z| 


( 4 ) 


This formula can also be found in [1, Eq. (10.98)]. Notice that this formula makes no assumption on the 
pairwise correlation between the signal Xi and the noise Zj. 

Next, we consider the problem from the viewpoint of communications theory and assume that the 
additive complex Gaussian noise Z_ is independent of the signal X. For the Gaussian channel, we let 
@x = S gt, where S T = (S u S 2 , ■ ■ • , S n ) is a deterministic vector with Y^i=i |S'i| 2 = P, and f is defined 
as the conjugate transpose of a vector (that is, = S* T ). Now the covariance matrix of the observables 
can be expressed as 


©y = Q x + © z = S + Q z 


( 5 ) 


since, in this case, ESiZj = 0 for 1 < i, j < n. With the above notation, Eq. (4) can be evaluated as 


C array — 


|5 5t + e z | 

lezl 


bits/channel use 


( 6 ) 


emphasizing the independent signal and noise components of the observable covariance matrix. Using a 
corollary to Theorem A.3.2 in [3], the ratio of determinants in Eq. (6) can be written as a quadratic form 
of the inverse noise covariance matrix and the signal vector as 


| S5t +© z | 
\*z\ 


= i +jgte£ 1 £ 


( 7 ) 


and, hence, the array capacity may be equivalently expressed as 

Carray = ~ log 2 (l + /^©y 1 /?) bits/channel use 


(8) 


While Eqs. (6) and (8) are mathematically equivalent, Eq. (8) is particularly important for the following 
reasons: First, it provides useful insights into the behavior of array capacity and, second, it suggests a 
simple receiver structure for processing the array observables. 


IV. Receiver Structure for an Array of Gaussian Channels 

Let w T = (wi,w 2 , ■ ■ ■ ,w n ) be a complex weight vector. It follows that the SNR of the scalar output 
v = w T Y (Y t = (Yi, Y 2 , • • • , Y n )) is given by 
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SNR = 


\w T S\ 2 

E(\w T Z\ 2 ) 


w T SS^w* 

wQzw* 


This expression holds for any weight vector. As shown in the Appendix, the weight vector w that 
maximizes the SNR is given by 


y^opt — a (®z 1 ^)* (10) 

where a is an arbitrary complex constant. Substituting Eq. (10) into Eq. (9) yields the optimal SNR 

SNR max = ste^S ( 11 ) 

which is seen to be the same as the quadratic form in Eq. (8). Thus, the quadratic form in Eq. (8) is 
equivalent to the maximum of the SNR obtained from an optimally weighted sum of the array observables. 
The array receiver structure implied by this observation is shown in Fig. 2. 



Fig. 2. Receiver structure derived from Eq. (8). 

It is well known that a Gaussian source achieves capacity for a Gaussian channel [1]. Since the output 
of the receiver in Fig. 2 is a weighted sum of the Xi plus Gaussian noise, it is a Gaussian random variable 
for any value of the source X : hence, the output is a Gaussian channel. Since the array capacity in Eq. (8) 
depends only on the maximum SNR of the output variable v, it follows that the capacity of the scalar 
channel of Fig. 2 equals the capacity of the array channel of Fig. 1. This is an important observation since 
it enables the conversion of an n-dimensional vector observable to a single-dimensional scalar observable 
without decreasing the capacity of the channel. 

Writing Eq. (8) in terms of Eq. (11) emphasizes the point that the channel capacity of the array 
depends only on the maximum SNR that can be achieved by a weighted sum of the array observables: 

Carroty = 2 ^°§2(1 T ^^moi) (12) 

It follows that the maximum of the mutual information between v and X can also be stated directly as 
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max{H{v) - H(v \X)} = C v = C array (13) 

p{x) 

which is simply the capacity C v of the scalar channel. 

When the components of the noise vector Z_ are uncorrelated, both the covariance matrix G z and 
its inverse Q^ 1 become diagonal matrices, with the diagonal element of the inverse matrix equal to the 
inverse of the corresponding diagonal element of the covariance matrix. With the ith diagonal element of 
Qz equal to g z % the maximized SNR becomes 


SNRma, = 5t Q^S = ^ ^ (14) 

i = 1 ^Z,i 

where the right-hand side of Eq. (14) is recognized as the sum of the individual channel SNRs. The array 
capacity follows directly as 

1 / " I 0.12 

Carray = X '°g2 1 + X/ 

1 \ i= i a z,i 

Thus, the capacity of an array of Gaussian channels with uncorrelated noise is equivalent to that of a 
single Gaussian channel, with an SNR equal to the sum of the individual channel SNRs. 

For the correlated noise case, the observation of the noise in any channel provides useful information 
about the noise in every other channel. This concept is called “noise cancellation” in the adaptive signal- 
processing literature. 


bits/channel use 


(15) 


V. Examples 

Some examples of simple array channels that allow closed-form analytic solutions and provide insights 
into the problem are considered. 

A. Two-Channel Array 

Consider a two-channel array with signal vector S_= {Si, S^) and noise covariance matrix 


©z = 


<7 


2 

Z,1 


P*GZ,1&Z,2 


PVZ,l°Z,2 

4,2 J 


(16) 


where EZiZ J = paz,i&z, 2 , E\Zi\ 2 — and F|Z 2 1 2 = <j 2 Z 2 - The determinant of the noise covariance 
matrix is |©z| = o'i,i fJ i, 2 ( 1 “ |p| 2 ); its inverse is given by 


0^ = 


\P I 2 


"z, i 
-P* 

L Gz ,lCZ ,2 


GZ,\Gz,2 

1 

4,2 J 


(17) 


and the resulting array channel capacity is 
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(18) 


C, 


array 


; l°g2 1 + 


|5 X | 2 , |S 2 | 2 2 Re{pSlS 2 ) 


i - IpI 2 


<7 


+ 


Z,1 


a 


Z, 2 


az,i<?z,2 


As the magnitude of the correlation coefficient approaches 1, the array capacity grows without bound 
except for the special case when \S\\/az,i = and the phase of p cancels the phase of 

This corresponds to the singular detection case in communications theory where the signal is recovered 
without error in the absence of noise. If the noise is uncorrelated (p = 0), the array capacity depends 
only on the sum of the channel SNRs, as noted above. If the noise is correlated but the signal is 0 in one 
of the channels (for example, |5 2 | =0), the array capacity reduces to 

+ |S 2 |=0 (19) 


This “noise-cancellation” result is independent of the phase of the correlation coefficient, implying that 
the complex noise samples in the two channels need not be phase aligned for perfect cancellation — in 
effect, the optimum weights defined in Eq. (10) ensure that the noise phases are properly aligned. The 
array capacity for the two-channel noise-cancellation case corresponding to Eq. (19) is shown in Fig. 3 
for several SNRs. 
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Fig. 3. Array channel for the two-channel case, ISgl = 0. 


B. Three-Channel Examples 

Next, we consider some three-channel examples, but only for the case of real signals and noise (in 
other words, here we ignore phase effects). We consider the triangular and the linear constellations as 
shown in Fig. 4, each having an array of three elements. We assume that the correlation coefficient of Zi 
and Zj, which is denoted by py, is geometrically related to the distance dy of array elements i and j as 
follows: 


Pa = P ldii{ (20) 

where max{— 1, a} < p < min{l, 6}, and a and b define the range [a, 6] C [—1,1] such that the covariance 
matrix ©z is legal, that is, ©z is non-negative definite and |©z| > 0. Also, 
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Fig. 4. Array configurations of three elements: (a) triangular configuration and 
(b) linear configuration. 


EZ^Z j — pij& (^ 1 ) 

The covariance matrices Q z and 6% of the triangular and linear constellations, therefore, are given by 


and 




"z,i 

perzperzp 

POz,lO-z,'i 


perzperzp 

perzperzp 


per z per z p 
per zperzp 

4,3 . 


e 2 


Z ~ 


per z per z p 

4,2 

Ip^erzperzp per z per z pi 


u zp 

perzperzp 


P erzperzp 
perzperzp 
4,3 . 


( 22 ) 


(23) 


By substituting Q z and 0| into Eq. (8), we can evaluate the array channel capacities of the above array 
constellations as a function of p. We consider two cases: channels having different SNRs and channels 
having the same SNR. 

1. Channels Having Different SNRs. Let er 2 xi = l,cr X2 = 1, and er X3 = 9; let o zl = 
0.16, a\ 2 = 0.49, and cr| 3 = 1.44. The array channel capacities of the two constellations as a function of 
p are given in Fig. 5. We observe that the array channel capacities approach infinity when p approaches 
—0.5 and 1.0 for the triangular constellation (when |©^| = 0) and when p approaches —1.0 and 1.0 for 
the linear constellation (when |0|| = 0). The channel capacities of both constellations are the same 
(3.2 bits/channel use) at p = 0, and they are lower bounded at 2.75 bits/channel use for this example. 
Thus, the maximum degradation due to noise correlation is only 0.45 bits/channel use. 

2. Channels Having the Same SNR. Let cr x l = 1.0, er 2 X 2 = 1.0, and er X 3 = 0.25; let a z l = 
0.16, cr| 2 = 0.16, and <r| 3 = 0.04. The array channel capacities of the two constellations as a function 
of p are given in Fig. 6. Again we observe that the array channel capacities approach infinity when p 
approaches —0.5 for the triangular constellation and when p approaches —1.0 for the linear constellation. 
However, both channel capacities approach 1.43 bits/channel use for the real signal and noise case, the 
channel capacity of a single channel, when p approaches 1.0. This is apparent from the fact that the 
receiver actually sees three scaled copies of the received signal plus noise, and this is equivalent to looking 
at one channel alone. 

The observations described in the two examples conform to our intuition, and the general formula 
given in Eq. (8) predicts all these observations. 
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Fig. 5. Array capacity of Gaussian channels: 
different SNRs. 



Fig. 6. Array capacity of Gaussian channels: 
same SNR. 


VI. Summary 

The capacity of an array of n Gaussian channels has been derived. The array channel was modeled 
as n observations of a single source, in the presence of additive Gaussian noise. It was shown that an 
optimally weighted sum of the array outputs achieves the same channel capacity as the array channel. 
Several examples of two- and three-channel arrays were discussed, and graphs of channel capacities were 
provided to illustrate array capacity as a function of noise correlation as well as display examples of 
singular behavior. 


References 


[1] T. Cover and J. Thomas, Elements of Information Theory, New York: Wiley 
Interscience, 1991. 

[2] V. Vilnrotter, E. Rodemich, and S. Dolinar, “Real-Time Combining of Resid- 
ual Carrier Array Signals Using ML Weight Estimates,” IEEE Trans. Comm., 
vol. 40, no. 3, pp. 604-615, March 1992. 

[3] T. Anderson, An Introduction to Multivariate Statistical Analysis, New York: 
Wiley, 1958. 


60 





[4] L. E. Brennan and L. S. Reed, “Theory of Adaptive Radar,” Array Process- 
ing Applications to Radar, ed. S. Haykin, Stroudsburg, Pennsylvania: Dowden, 
Hutchinson and Ross, Inc., 1980. 

[5] S. P. Applebaum, “Adaptive Arrays,” Array Processing Applications to Radar, 
ed. S. Haykin, Stroudsburg, Pennsylvania: Dowden, Hutchinson and Ross, Inc., 
1980. 

[6] H. H. Tan, R. Liang, and P.-H. Suen, “Optimum Combining of Residual Carrier 
Array Signals in Correlated Noises,” The Telecommunications and Data Acquisi- 
tion Progress Report 42-124, October-December 1995, Jet Propulsion Laboratory, 
Pasadena, California, pp. 33-52, February 15, 1996. 

http: / /edms- www .jpl.nasa.gov/tda/progress_report/42-124/124K. pdf 


Appendix 

Derivation of Optimum Combining Weights 


The following is a simple derivation of the optimum combining weights that maximize the SNR. Other 
derivations can be found in the adaptive signal processing literature [4,5] and in this issue [6], 

We start with a derivation for the uncorrelated noise case (diagonal covariance matrix). It is shown in 
[2], using the Cauchy-Schwarz inequality, that for the uncorrelated case the optimum combining weight 
vector JJ_ is proportional to 


u = (eD 1 vy 


where V_ is a signal vector, 0£» is a diagonal matrix with components a\ i , and O^, 1 is its inverse. When 
the weight vector U_ is applied, the combined SNR is maximized, achieving its upper bound, 


SNR max = v^e- l v 


Next, consider a correlated Gaussian noise vector with covariance matrix © z , and let D be a unitary 
matrix that diagonalizes © z . With the conjugate transpose of D, D~ 1 its inverse, and £>t = D~ l 
(unitary), we can write the diagonal covariance in terms of © z and D as 


©d = dq z d t 


Thus, D rotates vectors from the uncorrelated into the correlated reference frame, without changing their 
length, while its inverse rotates them in the opposite direction. Let S_ = D~ l V_ be a signal vector, and let 
W = D~ l U_ be a weight vector in the correlated frame. The optimum weights can be written in terms of 
© z and S as 
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u = (e-^vy = (D*e* z D^)~ 1 v* 


= D^sy 


This is the optimal weight vector in the uncorrelated frame, in terms of © z x and S_. Applying the inverse 
rotation operator £> _1 , we obtain the optimum weight vector in the correlated frame as 

w = d-'u = zr ^(e^s)* = (©^s)* 
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